Surpassing the 2012 XENON100 exclusion limit with 2011 data and information theory 
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Tremendous progress in the field of dark matter direct detection has allowed modern experiments to reach the 
sensitivity that is required to exclude large regions of parameter space. The strongest exclusion limit has been set 
by the XENON 100 collaboration which has recently updated the constraint on the dark matter parameter space 
to an even greater degree. The method used by the collaboration to set this limit is based on a profile Likelihood 
analysis, which separates the experimental data into bands to better discriminate a potential dark matter nuclear- 
recoil signal from the electronic-recoil background. However the use of bands is not strictly necessary and tends 
to make the limit over-conservative. Here we propose an alternative method based on information theory and 
apply it to the 201 1 data-set from the XENON 100 experiment. We derive a new exclusion limit with this data 
which is both stronger than that derived by XENON 100 with their profile Likelihood analysis using the same 
data, and competitive with the new limit which was derived using the recent 2012 data-set. 



INTRODUCTION 

Despite strong gravitational evidence for the existence of dark 
matter (DM) in the Universe (from galactic to cluster scales) 
its nature remains largely a mystery. Yet great progress has 
been made in this endeavour: direct detection experiments, 
based on the detection of dark matter induced nuclear recoils 
have set progressively stronger limits on the properties of dark 
matter (HE), particularly for masses in the GeV to TeV range. 

The current world-record limit on the spin-independent 
cross section for dark matter elastic scattering with nuclei has 
been set by the XENON100 experiment [3-5 1, which operates 
using both liquid and gaseous xenon with a fiducial volume of 
~ 40kg. The XENON 100 detector identifies events by two 
separate signals [6[: primary (SI) and secondary (S2) scintil- 
lation, the former of which is due to scintillation light originat- 
ing from the liquid part of the detector, while the latter comes 
from ionised electrons, which drift to the gaseous part of the 
detector under an electric field. The precision of the detector 
in identifying and excluding dark matter interactions crucially 
relies on its ability to discriminate the electronic -recoil (ER) 
background from nuclear-recoil events (NR), such as those 
potentially induced by Weakly Interacting Massive Particles 
(WIMPs), based on their different distributions in SI and S2. 

In order to derive limits on the spin-independent cross sec- 
tion as a function of dark matter mass, the XENON100 col- 
laboration employs a profile Likelihood approach [7 8 1. Such 
a method takes advantage of the distinct signatures in S1-S2 
of electronic and nuclear recoils by splitting the data-space 
into a number of bands (23 in [8| and 12 in [7|), which al- 
lows stronger limits to be placed on the WIMP-nucleon cross 
section than would otherwise be possible. 

Here we point out that in addition to problems associated 
with systematic uncertainties [9, 10], dividing the S1-S2 space 
into bands could potentially limit the sensitivity of the analy- 
sis, as discrimination is not possible between events within 
the same band. Hence despite the great achievements of 
the XENON 100 collaboration in setting limits on the WIMP- 
nucleon cross section, one can try to improve the analytical 
method, to exploit the full sensitivity of the experiment. 



In this letter we propose a Bayesian approach, based on 
examining the information Hamiltonian. We will demonstrate 
using the XENON100 data-set from 100 live days of running 
[4 1 that not only is this method consistent with the previous 
results obtained by the use of a profile Likelihood analysis 
El SI [8 ] and by a Bayesian analysis performed in [11 J, but it 
also leads to an exclusion limit which is stronger with respect 
to J4] [8] [TT] and competitive with Q where new data is used, 
due to the use of the full data-space. The analysis using the 
method presented here of the newest XENON 100 data (3) is 
left to the collaboration themselves. 



INFORMATION THEORY 

The method presented here uses all information present 
in the full data-space (a requirement of information theory 
lfL2ll ) in a probabilistic manner, while keeping cuts and coarse- 
graining to a minimum. It is defined within the S1-S2 data- 
space Q. which is parameterised by x = (S1,S2). We seek to 
determine to what degree a signal can be observed within the 
data above background. One can identify two relevant func- 
tions to be taken as input for the analysis: the first is the actual 
experimental data, defined as a set of N discrete (S 1 ,S2) points 
labelled as xf ata = (Sl? ata ,S2f ata ), where i runs from 1 to N. 
The second is the expected distribution of events X(x) which 
constitutes the theoretical expectation of both the background 
and possible signal at any point x in the data-space. The ex- 
perimental data can be thought of as a discrete sample of the 
theoretical distribution X(x). 

We model the data-space Q. by pixelating it into n two- 
dimensional bins of equal size in S1-S2 given by Ax and la- 
belled with the index j. Within a pixel j at position Xj — 
(Slj,S2j) in the S1-S2 plane there will be a certain number 
of experimental data-points jc data . This number will be defined 
to be tij. For the same pixel, the theoretically expected num- 
ber of points is given by Xj = X(xj)Ax. Hence we can define a 
Poisson likelihood for all pixels over the whole data-space, by 
considering Xj as the mean expectation value of the number 
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FIG. 1: XENON 100 data from 100 live days of running [4| (black dots), the expected distribution of background (red) and 
WIMP events (blue), with a contour spacing of 10% (12.5%) of the maximum for signal (background). The signals for the two 
masses are clearly distinct due to their different recoil spectra. Note that logio(S2/Sl) has been used in place of S2 for clarity. 



of points Hi in each pixel j. Such a Likelihood is given by, 



*ew = n -v 

j=\ n j- 



(i) 



Here d represents the data and s represents the expected con- 
tribution from a possible recoil signal in X(x). We can separate 
X(x) into its constituent signal /nr(x) and electronic-recoil 
background b£R(x) parts as X(x) — fNR(x)e s + bE R (x), where 
s is given in terms of the spin-independent elastic WIMP- 
nucleon cross-section a as s = In/ = ln(a/ao) (c?o is an ar- 
bitrary reference cross-section). Note we assume that ERs 
are the dominant source of background, and disregard any 
neutron-induced background, as is done for the analysis used 
by XENON 100 |8|. Dropping this assumption with a well- 
motivated model could potentially improve the accuracy of 
our analysis in setting limits. 

We must also define a prior for the parameter s. We have no 
theoretical prejudice on its value and therefore consider a flat 
prior i.e. assign to all possible log-cross section values s the 
same a priori probability density function fP(s) = const. Since 
it extends over an infinite range fP(i) is an improper prior. One 
can alternatively assume that T(s) = 1/ (s ma x — ^min) for s £ 
[■Smin^max] and T{s) = otherwise. However, provided that 
[*min,*max] covers all interesting log-cross-sections, the results 
do not depend on the precise interval. This has been confirmed 
for the analysis of XENON 100 data using s m [ n = — 16 and 
s max = — 2 and directly integrating over the posterior volume. 
Hence we will use fP(s) = const throughout this letter. 

We can now combine the Likelihood (P(d\s) and prior 
!P(s) into the joint data and signal probability (P(d,s) = 
(P(d\s)(P(s). We will work with the information Hamiltonian, 



H = -ln2V, s) = £ (Xj - njlnXj) , 



(2) 



where = means that signal-independent terms have been 
dropped. The limit can now be taken where Ax —> 0, so that 



each pixel can only contain either 1 or data-points. In this 
limit the Hamiltonian becomes 
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where the 8-function picks out the positions of the N data- 
points xf ata . We define F — j n dxf^R{x), the total number 
of reference signal (nuclear-recoil) events in the data-space 
calculated at Gq. 

If the Hamiltonian possesses a classical minimum at s c i 
then d s H(d,s c i) = and there is a positive identification of 
signal in the experimental data. In this case the significance 
of such a discovery is given by the depth of the minimum. 

However, if such a classical minimum does not exist, one 
seeks to define an exclusion limit on the DM-nucleon cross- 
section (at a certain level of confidence). This can be done 
by finding the signal parameter si; m j t = ln(fi; m j t ) at which the 
Hamiltonian is increased by a value AH from H(d,s = — °°). 
We look for the value of t where the likelihood is reduced 
by an order of magnitude with respect to the case of no DM 



i.e. e 



-AH 



= 0.1. Such a limit is numerically similar to a fre- 
quentist limit at 90% confidence, though they are in principle 
distinct and have different interpretations. By rearranging eq. 
[3]one obtains an expression for the limiting cross-section, 
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where i sums over all data-points at positions x, and w, are 
data weights with Vf; = fNR(xi)/bER{xi). Eq. |4] contains two 
terms: the first AH /F is data-independent and gives the ab- 
solute limit in the case where no NR events are observed in 
the data. The second term accounts for potential NR events 
present in the data, and weakens the limit. We now pro- 
ceed to discuss how /nr(x) and bER(x) are determined for the 
XENON 100 experiment in order to obtain a limit. 



3 



SIGNAL DISTRIBUTION 
WIMP Recoil Spectrum 

Potential WIMP events are characterised by their energy 
dependent recoil spectrum parameterised as fT3l[l4l . 



Carlo sampling at a value of the reference cross-section Oo 



dR 
AE 



o(E) 
2m/A 



2 pr)(£',/), 



(5) 



where o(E) is the WIMP-nucleus cross-section as a func- 
tion of energy E, /j is the WIMP-nucleus reduced mass, 
p = 0.3 GeVcm~ 3 is the local dark matter density and r\(E) is 
the WIMP mean velocity, given by r\(E) = J" v °°. (£) /(v +" e) d 3 v. 
The mean velocity is integrated over the distribution of WIMP 
velocities in the galaxy f(v) boosted into the reference frame 
of the Earth by u e . The lower limit of the integration is 
v m i n (E), which is the minimum WIMP velocity required to in- 
duce a recoil of energy E. We assume the standard halo model 
such that /(v) is given by a Maxwell-Boltzmann distribution 
cut off at an escape velocity of v eS c = 544kms~ 1 . 

We assume that WIMPs interact identically with protons 

and neutrons giving o(E) =a (^j^j A 2 F 2 (E), where o is the 
zero-momentum WIMP-nucleon cross section, A is the atomic 
mass of xenon, /j p is the WIMP-proton reduced mass and 
F(E) is the nuclear form factor (taken to be of the Helm form 
1 13 1). Exclusion limits are set on a for each WIMP mass. 

Calculation of SI and S2 Signals 

At a given nuclear-recoil energy E the expected primary 
(Slexp) an d secondary (5 , 2 exp ) scintillation signals are obtained 
from the following formulae lfT0l[T5l[T6ll , 



Slexp = P[E-L eG (E)-L y 



S„ 



1 exp - ^ — — en \— , — v „ 

S2 exp = 20 -P(E-Q y (E)), 



(6) 
(7) 



where P(x) represents a Poisson distribution with expectation 
value x, Ly = 2.20±0.09PEkeV~ 1 , s f = 9.5/5.8 , L eS (E) is 
the relative scintillation efficiency and Q y (E) is the ionisation 
yield (we use the best fit functions from ifTBI here and also 
perform our own fit of L e ff to the NR band as in [ 15 1). 

To obtain the Sl bs an d S2 ti S signals observed in the de- 
tector, we must include the finite detector resolution and the 
20ns SI time-coincidence requirement J4j[7]]. The latter re- 
quires that all events in the XENON 100 detector have at least 
two SI signals within 20ns of each other, calculable using the 
probability distribution of scintillation emission as discussed 
in the appendix of [ 15 1. Both Sl exp and 52 exp are blurred with 
a gaussian of width 0.5 ^fn for n photoelectrons (PE) to take 
account of the finite photomultiplier (PMT) resolution. Addi- 
tionally 52 exp is subject to gaussian fluctuations for the finite 
size of a single PE, with width 0.58 per PE ifTBll . 

Dark matter signal in XENON100 

The expected signal distribution for a given WIMP mass 
in the data-space /nr(x) can now be calculated by Monte 



10 cm (or 10 cm for m < lOGeV). The energy range 
between 1 keV and 60keV is separated into bins of size AE 
().() I keV. For each binned energy E Tec we calculate Sl bs an d 
52 Q bs a total of N re c times, where N rec = gf (<5o,E rec )AE, to 
obtain the full signal distribution as expected in XENON 100. 

BACKGROUND DISTRIBUTION 

The expected distribution of electronic-recoil background 
events b£R (x) is determined from fits to 60 Co calibration data, 
as is done in Q]|8). Although electronic recoil appear mostly 
Gaussian distributed, the XENON 100 collaboration noticed 
the presence of an anomalous (non-Gaussian) background 
component [4]. This could be due to double-scatter gamma 
events, where only one of the gammas contributes to the S2 
signal. The distribution is normalised by the total number 
of expected background events, whose rate takes the constant 
value of 0.0061 counts per day per kg per keV ifTTl . 

RESULTS 

Using the expected distributions of signal /nr(x) and 
background b£R{x) as calculated in the previous sections 
and shown in fig [T] a limit has been placed on the spin- 
independent WIMP-nucleon cross section with 100 live days 
of XENON 100 data S by solving eq. |4]iteratively for WIMP 
masses between 6GeV to 400GeV. The result is shown as a 
solid red line in fig. |2]along with its one-sigma L e ff uncertainty 
and the limits obtained by the XENON 100 collaboration. 
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Limit from information theory 

and XENON100 data from 100 live days. 

L cff uncertainty from 
one-sigma fitting contours. 

XENON100 profile likelihood analysis 
using 100 Live Days of data (2011). 

XENON100 profile likelihood analysis 
using 225 Live Days of data (2012). 
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FIG. 2: Limit calculated using the method presented in this 
letter and 100 live days of XENON100 data H (solid red 
line) with uncertainties due to L e ff fitting as the shaded 
region. The 2011 [4] and 2012 [3] exclusion limits obtained 
by the XENON 100 collaboration are shown as a black 
dashed line and green dot-dashed line respectively. 

Clearly the use of this new analytical method has resulted 
in a sharp improvement in sensitivity over the profile Like- 
lihood method used by XENON 100, where we refer specif- 
ically to the analysis of XENON 100 using bands from here 
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on. In fact we obtain with this Bayesian method and the 201 1 
data the same exclusion curve as the XENON100 2012 limit 
[3 1, for WIMP mass m > 50GeV. At low mass the Bayesian 
method seems to be more powerful. This is due to the fact 
that the DM signature is very distinct from the electronic- 
recoil background for low-mass WIMPs, as displayed in fig. 
[T] Such discrimination at low-mass is not fully exploited by 
the XENON100 analysis, where the same weights (i.e. the 
expected ratio of signal /nr(x) to background &£r(x)) are de- 
fined solely using calibration data for all points within the 
same band. Hence information on the WIMP recoil spectrum 
is largely lost in the profile Likelihood method used by the 
XENON 100 collaboration. Additionally this highlights that 
when identifying points as potential signal or defining a sig- 
nal region care should be taken to emphasise that this is po- 
tentially dependent on the WIMP mass. 

For large WIMP masses the extra sensitivity gained can be 
attributed to the more accurate assignment of weight to the 
data. The bands used by the XENON 100 collaboration QUI 
ascribe too much weight to points which are actually consis- 
tent with the background model, hence resulting in the deriva- 
tion of an over-conservative limit. Our analysis also appears to 
benefit from an increased sensitivity relative to other Bayesian 
works such as ifTTI . where extensive cuts were made. These 
restricted the data-space to the region where S2 is smaller 
than the mean of the nuclear-recoil band and resulted in a 
loss of information and therefore exclusion sensitivity. Re- 
gions of data-space where signal could potentially be found 
are removed in addition to those where the background is ex- 
pected to be large (shown explicitly in ifTHl ). Such cuts were 
not made in deriving our exclusion limit, where all data above 
51 = 2PE and S2 = 300 PE are used. 

Our analysis possesses several uncertainties: we use the re- 
gion of data-space from SI = 2PE to 51 = 4PE, which was 
cut away in [4]. This has provided us with greater sensitiv- 
ity to low-mass WIMPs, but may have introduced some extra 
uncertainties due to the background in this region. Though 
under a variety of background models the low-mass limit re- 
mains robust. We determine L e ff (eq- [6) from fits to data, both 
direct measurements of L e ff lfT9l and fits to 241 AmBe calibra- 
tion data 03). Hence in addition to a best-fit function one can 
define contours representing one-sigma deviations consistent 
with the data J9] |20) . Uncertainties on the exclusion limit due 
to this one-sigma L e ff deviation are shown as the shaded re- 
gion in fig. [2] They remain of a similar magnitude to those 
present with the profile Likelihood approach using bands J9)> 
but are not shown by the XENON 100 collaboration (3]|4). A 
more extensive analysis is performed in 1 18 1, including a dis- 
cussion of the form of L en - used here and its uncertainties from 
fitting. We note that both the larger uncertainties at low-mass 
[9 1 and the extra sensitivity gained in this region are due to the 
recoil spectra being strongly peaked at lower energies. Hence 
although the uncertainties are still large at low mass, the ex- 
tra sensitivity compared to the profile Likelihood method of 
XENON 100 results in a limit which is still stronger, even with 
a rather extreme one-sigma deviation of L e ff from its best-fit. 



Astrophysical uncertainties ll2TH23l have not been dis- 
cussed here but remain important. 

CONCLUSIONS 

We have demonstrated that exploiting every pixel in (S 1 ,S2) 
space instead of using bands has resulted in an exclusion limit 
which can surpass the 2012 limit obtained by the XENON100 
collaboration Q. 

The application of the method presented here to the new 
2012 data is left to the XENON 100 collaboration, with the 
hope that they can use their expert knowledge of their detector 
and backgrounds to set an even stronger limit on the spin- 
independent WIMP-nucleon cross-section. 

We hope that the method presented in this letter is intuitive 
and powerful enough for XENON 100 and other experimental 
collaborations to employ. With a consistent analytical method 
used by all dark matter direct detection experiments, the cur- 
rent constraints on the WIMP cross-section should be both 
stronger and clearer. 
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